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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: PLANT BIOSCIENCE LIMITED 

(B) STREET: COLNEY LANE 

(C) CITY: NORWICH 

(D) STATE: NORFOLK 

(E) COUNTRY: UK 

(F) POSTAL CODE (ZIP) : NR4 7UH 
(ii) TITLE OF INVENTION: PROTEIN 

(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: PatentIn Release #1.0, 



(2) INFORMATION FOR SEQ ID NO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1205 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



'sion #1.25 (EPO) 




(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 18 . . 9 



(xi) SEQUENCE DESCRII 



^ON: SEQ ID NO 



CCAAATCCCA TCTCAAA ATG ;y(T GAT CTA TTT TCA GGA TCT TTC TCT CGT 
Met jLsn Asp Leu Phe Ser Gly Ser Phe Ser Arg 
1/ 5 10 

TTC AGA GCT GAC GAT jCAA TCG GAC TCT CAC GCC ATA GAA ATG GGA GAC 
Phe Arg Ala Asp Asn/oin Ser Asp Ser His Ala lie Glu Met Gly Asp 
15 / 20 25 

ATT ACT GGC GGAybTC AAT CTC GAC AAA TTC TTC GAA GAT GTT GAA GCC 
He Thr Gly Gly Val Asn Leu Asp Lys Phe Phe Glu Asp Val Glu Ala 
30 / 35 40 

ATT AAA GAO^GAA CTC AAA GGC CTC GAG AAA ATC TAT TCC CAA CTC CAA 
He Lys Asp Glu Leu Lys Gly Leu Glu Lys He Tyr Ser Gin Leu Gin 
45 / 50 55 

TCT rc^ CAT GAA AAA AGC AAG ACT CTT CAC AAC GCT AAA GCC GTT AAA 
Ser SdT His Glu Lys Ser Lys Thr Leu His Asn Ala Lys Ala Val Lys 
60 / 65 70 75 

GAy CTA AGA TCC AAC ATG GAT AAT GAC GTT TCC ATG GCA TTG AAG AAA 
fp Leu Arg Ser Asn Met Asp Asn Asp Val Ser Met Ala Leu Lys Lys 
60 85 90 

GCC AAA TTC ATC AAA GTT CGT CTC GAA GCC TTA GAC AGA TCA AAT GCA 
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Ala Lys Phe lie Lys Val Arg Leu Glu Ala Leu Asp Arg Ser Asn Ala 
95 100 105 

GCG AAT CGA AGC CTC CCT GGA TGT GGA CCC GGA AGT TCA TCT GAC AGG 

Ala Asn Arg Ser Leu Pro Gly Cys Gly Pro Gly Ser Ser Ser Asp Arg 
110 115 120 



ACG AGA ACT TCA GTT GTG AAC GGA TTA AGG AAG AAA CTT CAA GAG TCA 
Thr Arg Thr Ser Val Val Asn Gly Leu Arg Lys Lys Leu Gin Glu Ser 
125 130 135 



ATG AAT CAG TTC AAC GAG CTA AGG CAA AAG ATG GCA TCT GAA TAT AGG 
Met Asn Gin Phe Asn Glu Leu Arg Gin Lys Met Ala Ser Glu Tyr Arg 
140 145 150 155 



GAA ACA GTT CAA CGA CGA TAT TAT ACC GTC ACA GGA GAA AAT CCT GAT 
Glu Thr Val Gin Arg Arg Tyr Tyr Thr Val Thr Gly Glu Asn Pro Asp 
160 165 170 



GAA GCA GTT CTT GAT ACA CTC ATA TCT ACA GGT CAA AGT GAG ACG TTC 
Glu Ala Val Leu Asp Thr Leu lie Ser Thr Gly Gin Ser Glu Thr Phe 
175 180 185 



TTG CAA AAG GCA ATT CAA GAG CAA GGG AGA GGA CAA GTG ATG GAT ACA 
Leu Gin Lys Ala lie Gin Glu Gin Gly Arg Gly Gin Val Met Asp Thr 
190 195 200 



GTT ATG GAA ATT CAA GAA AGG CAT GAA GCT GTG AAG GAA TTG GAG AGG 
Val Met Glu lie Gin Glu Arg His Glu Ala Val Lys Glu Leu Glu Arg 
205 210 215 



AAT TTG AAA GAA TTG CAT CAA GTA TTC TTG GAC ATG GCT GTT TTG GTT 
Asn Leu Lys Glu Leu His Gin Val Phe Leu Asp Met Ala Val Leu Val 
220 225 230 235 



GAA AGT CAA GGA GCT CAA CTT GAT GAT ATT GAG AGC CAA GTG AAT AGG 

Glu Ser Gin Gly Ala Gin Leu Asp Asp lie Glu Ser Gin Val Asn Arg 

240 245 250 

GCT AAT TCC TTC GTT AGA GGG GGT GCT CAG CAA CTG CAA GTG GCA AGG 

Ala Asn Ser Phe Val Arg Gly Gly Ala Gin Gin Leu Gin Val Ala Arg 
255 260 265 



AAG CAC CAG AAG AAC ACT AGA AAA TGG ACT TGT TTT GCT ATT ATT CTT 
Lys His Gin Lys Asn Thr Arg Lys Trp Thr Cys Phe Ala lie He Leu 
270 275 280 



CTG CTT ATC ATC ATT TTG GTG GTG GTT CTT TCT ATT CAG CCA TGG AAA 
Leu Leu He lie He Leu Val Val Val Leu Ser He Gin Pro Trp Lys 
285 290 295 

AAA TGAGAATTTG TCTATGGTCA AAGGTCTTCT GGTGGACCCC TTCAATGTTT 

Lys 

300 

TGAATATTCT AAATTTTTAT ATTTTATTAT TTTAGCCATG CTTATTATTT TGTGTTATTT 
TGGATTTTTT TTTTGTTTTT AATGTGGGGA AGAGTAAACT GGATGGGGGT CCATGTGCTA 
TTTAGAGAAA TACTTGGGAG TTCTCTTTTT GTAATTATTG CTGTATTTAG AGTATAATTC 
TTTTTCTATA TTGTTGGCAG GTTAATTTGT TTGTTTGATT ATATTCTCAT TTAGATTT 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
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386 

434 

482 

530 

578 

626 

674 

722 

770 

818 

866 

914 

967 

1027 
1087 
1147 
1205 
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(A) LENGTH: 3 00 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

* Met Asn Asp Leu Phe Ser Gly Ser Phe Ser Arg Phe Arg Ala Asp Asp 

15 10 15 

Gin Ser Asp Ser His Ala lie Glu Met Gly Asp lie Thr Gly Gly Val 
20 25 30 

Asn Leu Asp Lys Phe Phe Glu Asp Val Glu Ala lie Lys Asp Glu Leu 
35 40 45 

Lys Gly Leu Glu Lys lie Tyr Ser Gin Leu Gin Ser Ser His Glu Lys 
SO • 55 60 

Ser Lys Thr Leu His Asn Ala Lys Ala Val Lys Asp Leu Arg Ser Asn 
65 70 75 80 

if] Met Asp Asn Asp Val Ser Met Ala Leu Lys Lys Ala Lys Phe lie Lys 

ffl 85 90 95 

?=J Val Arg Leu Glu Ala Leu Asp Arg Ser Asn Ala Ala Asn Arg Ser Leu 

ill ^00 105 110 

- Pro Gly Cys Gly Pro Gly Ser Ser Ser Asp Arg Thr Arg Thr Ser Val 

W 115 120 125 

Val Asn Gly Leu Arg Lys Lys Leu Gin Glu Ser Met Asn Gin Phe Asn 
130 13 5 14 0 

111 Glu Leu Arg Gin Lys Met Ala Ser Glu Tyr Arg Glu Thr Val Gin Arg 

|1J 145 150 155 160 

'r' Arg Tyr Tyr Thr Val Thr Gly Glu Asn Pro Asp Glu Ala Val Leu Asp 

Cl 165 170 175 

E_i 

Thr Leu lie Ser Thr Gly Gin Ser Glu Thr Phe Leu Gin Lys Ala lie 
180 185 190 

Gin Glu Gin Gly Arg Gly Gin Val Met Asp Thr Val Met Glu lie Gin 
195 200 205 

Glu Arg His Glu Ala Val Lys Glu Leu Glu Arg Asn Leu Lys Glu Leu 
210 215 220 

His Gin Val Phe Leu Asp Met Ala Val Leu Val Glu Ser Gin Gly Ala 
225 230 235 240 

Gin Leu Asp Asp lie Glu Ser Gin Val Asn Arg Ala Asn Ser Phe Val 
245 250 255 

Arg Gly Gly Ala Gin Gin Leu Gin Val Ala Arg Lys His Gin Lys Asn 
260 265 270 

Thr Arg Lys Trp Thr Cys Phe Ala lie lie Leu Leu Leu lie lie He 
275 280 285 

Leu Val Val Val Leu Ser He Gin Pro Trp Lys Lys 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 3: 
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(i) SEOUKNCE CHARACTERISTICS: 

(A) LENGTH: 13 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

{A) NAME/KEY: CDS 

(B) LOCATION: 77.. 991 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCCTCG AGCTACGTCA GGGATTCATT CCGATCTGAA ATCTCTCTCT AGATTTCTCT 60 

ATTTTTCGAA T TT TAA ATG AAC GAT TTG TTT TCC AGC TCA TTC TCT CGC 109 
Met Asn Asp Leu Phe Ser Ser Ser Phe Ser Arg 
15 10 

TTC CGC AGC GGA GAA CCA TCC CCT CGC CGA GAC GTT GCC GGC GGT GGC 157 
Phe Arg Ser Gly Glu Pro Ser Pro Arg Arg Asp Val Ala Gly Gly Gly 
15 20 25 

GAC GGA GTT CAG ATG GCG AAT CCC GCG GGA TCA ACC GGT GGT GTG AAC 205 
Asp Gly Val Gin Met Ala Asn Pro Ala Gly Ser Thr Gly Gly Val Asn 
30 35 40 

CTC GAC AAG TTC TTC GAA GAT GTT GAA TCT GTG AAA GAA GAG CTA AAG 253 
Leu Asp Lys Phe Phe Glu Asp Val Glu Ser Val Lys Glu Glu Leu Lys 
45 50 55 

GAG CTA GAT CGG CTC AAC GAA ACA CTC TCT TCA TGT CAC GAG CAG AGC 3 01 

Glu Leu Asp Arg Leu Asn Glu Thr Leu Ser Ser Cys His Glu Gin Ser 
60 65 70 75 

AAG ACG CTT CAC AAT GCT AAA GCC GTT AAA GAT CTC CGG TCT AAA ATG 34 9 

Lys Thr Leu His Asn Ala Lys Ala Val Lys Asp Leu Arg Ser Lys Met 
80 85 90 

GAC GGT GAC GTT GGA GTC GCG TTG AAG AAG GCG AAG ATG ATT AAA GTT 3 97 

Asp Gly Asp Val Gly Val Ala Leu Lys Lys Ala Lys Met lie Lys Val 
95 100 105 

AAA CTC GAG GCG CTA GAT CGT GCC AAT GCT GCT AAT CGG AGT CTC CCT 44 5 

Lys Leu Glu Ala Leu Asp Arg Ala Asn Ala Ala Asn Arg Ser Leu Pro 
110 . 115 120 

GGC TGT GGA CCT GGT TCT TCC TCC GAT CGA ACC AGG ACC TCT GTC CTC 4 93 

Gly Cys Gly Pro Gly Ser Ser Ser Asp Arg Thr Arg Thr Ser Val Leu 
125 130 135 

AAT GGT CTC AGG AAG AAA TTG ATG GAC TCT ATG GAT AGT TTC AAC CGA 541 
Asn Gly Leu Arg Lys Lys Leu Met Asp Ser Met Asp Ser Phe Asn Arg 
140 145 150 155 

TTG AGG GAG CTT ATC TCG TCC GAG TAT AGA GAA ACT GTA CAG AGG AGG 58 9 

Leu Arg Glu Leu lie Ser Ser Glu Tyr Arg Glu Thr Val Gin Arg Arg 
160 165 170 

TAC TTC ACC GTC ACC GGC GAG AAT CCG GAT GAA CGA ACC CTA GAT CGA 63 7 

Tyr Phe Thr Val Thr Gly Glu Asn Pro Asp Glu Arg Thr Leu Asp Arg 
175 180 185 

CTG ATT TCC ACT GGA GAG AGT GAG AGA TTC TTG CAG AAA GCA ATA CAA 685 
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Leu lie Ser Thr Gly Glu Ser Glu Arg Phe Leu Gin Lys Ala lie Gin 
190 195 200 

GAA CAA GGA AGA GGA AGG GTG TTA GAC ACC ATT AAC GAG ATT CAA GAA 73 3 

Glu Gin Gly Arg Gly Arg Val Leu Asp Thr lie Asn Glu lie Gin Glu 
205 210 215 

AGG CAT GAT GCG GTT AAA GAC ATT GAG AAG AAT CTC AGG GAG CTT CAC 781 
Arg His Asp Ala Val Lys Asp lie Glu Lys Asn Leu Arg Glu Leu His 
220 225 230 235 

CAG GTG TTT CTA GAC ATG GCC GTG CTG GTA GAG CAC CAG GGA GCT CAG 829 
Gin Val Phe Leu Asp Met Ala Val Leu Val Glu His Gin Gly Ala Gin 
240 245 250 

CTT GAT GAC ATC GAG AGT CAT GTG GGT CGA GCT AGC TCC TTT ATC AGA 877 
Leu Asp Asp lie Glu Ser His Val Gly Arg Ala Ser Ser Phe lie Arg 
255 260 265 

GGC GGA ACT GAC CAG CTA CAA ACC GCT CGG GTT TAC CAG AAG AAC ACG 925 
Gly Gly Thr Asp Gin Leu Gin Thr Ala Arg Val Tyr Gin Lys Asn Thr 
270 275 280 

CGA AAA TGG ACA TGT ATT GCC ATT ATT ATT CTC ATC ATC ATC ATA ACT 973 

111 Arg Lys Trp Thr Cys lie Ala lie lie lie Leu lie lie lie lie Thr 

Q 285 290 295 

J-J GTT GTG GTT CTT GCT GTT TTAAAACCGT GGAACAACAG CAGTGGCGGC 1021 

''^1 Val Val Val Leu Ala Val 

III 300 305 

GGCGGCGGTG GTGGTGGTGG GGGTACCACT GGAGGAAGTC AACCAAATTC AGGGACACCA 1081 

C| CCAAATCCTC CTCAGGCAAG GCGTCTATTG CGTTGAAGTT GAAGTTGAAG TTGAGTTTCG 1141 

JjJ TTATTTGCAT ATATATTCTT TCTTTGAAAA ACCTTATTAT CAAACCAGCT TTGTGTTACT 1201 

J= ACTTTCTACT GCTGGTTTGT TGTTAATCTC CCGTTTATTT GGTTTTTGTG AAAGAATTTA 1261 

a 

AAATGTGGGT TAGATGAGAA AATTAGTACA ACATTCTCTT GTATCTATGT TTGCTACCCT 1321 
GACGTAGCTC GAG 1334 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asn Asp Leu Phe Ser Ser Ser Phe Ser Arg Phe Arg Ser Gly Glu 
15 10 IS 

Pro Ser Pro Arg Arg Asp Val Ala Gly Gly Gly Asp Gly Val Gin Met 
20 25 30 

Ala Asn Pro Ala Gly Ser Thr Gly Gly Val Asn Leu Asp Lys Phe Phe 
35 40 45 

Glu Asp Val Glu Ser Val Lys Glu Glu Leu Lys Glu Leu Asp Arg Leu 
50 55 60 
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Asn Glu Thr Leu Ser Ser Cys His Glu Gin Ser Lys Thr Leu His Asn 
65 70 75 80 

Ala Lys Ala Val Lys Asp Leu Arg Ser Lys Met Asp Gly Asp Val Gly 
85 90 95 

Val Ala Leu Lys Lys Ala Lys Met lie Lys Val Lys Leu Glu Ala Leu 
100 105 110 

Asp Arg Ala Asn Ala Ala Asn Arg Ser Leu Pro Gly Cys Gly Pro Gly 
115 120 125 

Ser Ser Ser Asp Arg Thr Arg Thr Ser Val Leu Asn Gly Leu Arg Lys 
130 135 140 

Lys Leu Met Asp Ser Met Asp Ser Phe Asn Arg Leu Arg Glu Leu lie 
145 ISO 155 160 

Ser Ser Glu Tyr Arg Glu Thr Val Gin Arg Arg Tyr Phe Thr Val Thr 
165 170 175 

Gly Glu Asn Pro Asp Glu Arg Thr Leu Asp Arg Leu lie Ser Thr Gly 
180 185 190 

Glu Ser Glu Arg Phe Leu Gin Lys Ala lie Gin Glu Gin Gly Arg Gly 
195 200 205 

Arg Val Leu Asp Thr He Asn Glu He Gin Glu Arg His Asp Ala Val 
210 215 220 

Lys Asp He Glu Lys Asn Leu' Arg Glu Leu His Gin Val Phe Leu Asp 
225 230 235 240 

Met Ala Val Leu Val Glu His Gin Gly Ala Gin Leu Asp Asp lie Glu 
245 250 255 

Ser His Val Gly Arg Ala Ser Ser Phe He Arg Gly Gly Thr Asp Gin 
260 265 270 

Leu Gin Thr Ala Arg Val Tyr Gin Lys Asn Thr Arg Lys Trp Thr Cys 
275 280 285 

He Ala He He He Leu He He He He Thr Val Val Val Leu Ala 
290 295 300 

Val 
305 
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SEQ. ID. No. 5 

TTTAGATTTACTCTTATATTAGTTTGTTTGTTTAATTGGACGGTTGTTATATCTTTTTCTTA 

ATATGAGATTTATGTCGTTATTAATGTTTTTCTCTTGAGGGTTCATAAAGAGATTTATCGTG 

TACCTGGGGGTAGGTCAAATGAGAAGGGGTGTAATTTTTGTTTTTTTTTTAGGTTTTATTGT 

GTTTTATTATTCGTACCGATTTTATTATTTTATATTTTTAAATCTTATAAGTTTTGTAACTT 

CCCCAGGTGGTCTTCTGGAAACTGGTATCTGTTTAAGAGTAAAAAAGGTACCGACTTATCTT 

TCTTGGTGGTGGTTTTACTACTATTCGTCTTCTTATTATCGTTTTGTTCAGGTAAAAGATCA 

CAAGAAGACCACGAAGGAACGGTGAACGTCAACGACTCGTGGGGGAGATTGCTTCCTTAATC 

GGGATAAGTGAACCGAGAGTTATAGTAGTTCAACTCGAGGAACTGT^AAGTTGGTTTTGTCGG 

TACAGGTTCTTATGAACTACGTTAAGAAAGTTTAAGGAGAGGTTAAGGAAGTGTCGAAGTAC 

GGAAAGAACTTAAAGGTATTGACATAGGTAGTGAACAGGAGAGGGAACGAGAACTTAACGGA 

AAACGTTCTTGCAGAGTGAAACTGGACATCTATACTCACATAGTTCTTGACGAAGTAGTCCT 

AAAAGAGGACACTGCCATATTATAGCAGCAACTTGACAAAGGGATATAAGTCTACGGTAGAA 

AACGGAATCGAGCAACTTGACTAAGTAACTGAGAACTTCAAAGAAGGAATTAGGCAAGTGTT 

GACTTCAAGAGCAGGACAGTCTACTTGAAGGCCCAGGTGTAGGTCCCTCCGAAGCTAAGCGA 

CGTAAACTAGACAGATTCCGAAGCTCTGCTTGAAACTACTTAAACCGAAAGAAGTTACGGTA 

CCTTTGCAGTAATAGGTACAACCTAGAATCTAGAAATTGCCGAAATCGCAACACTTCTCAGA 

ACGAAAAAAGTACCCTTCTAACCTCAACCCTTATCTAAAAGAGCTCCGGAAACTCAAGCAGA 

AATTACCGAAGTTGTAGAAGCTTCTTAAACAGCTCTAACTGAGGCGGTCATTACAGAGGGTA 

AAGATACCGCACTCTCAGGCTAACTAGCAGTCGAGACTTTGCTCTCTTTCTAGGACTTTTAT 

CTAGTAAGTAAAACTCTACCCTAAACC 
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